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TITLE 



JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



LOCUS EIF2C1 7478 bp mRNA linear PRI 05-NOV-2002 

DEFINITION Homo sapiens eukaryotic translation initiation factor 2C, 1 

(EIF2C1) , mRNA. 
ACCESSION NM_012199 
VERSION NM_012199.1 GI : 6912351 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 7478) 

AUTHORS Koesters,R., Adams, V., Betts,D., Moos,R., Schmid,M., Siermann,A w 
Hassam,S., Weitz,S., Lichter,P., Heitz,P.U., von Knebel 
Doeberitz,M. and Briner,J. 

Human eukaryotic initiation factor EIF2C1 gene: cDNA sequence, 
genomic organization, localization to chromosomal bands Ip34-p35, 
and expression 

Genomics 61 (2), 210-218 (1999) 
20005943 
10534406 

PROVISIONAL REFSEQ: This record has not yet been subject to final 
NCBI review. The reference sequence was derived from AF093097 . 1 . 
Summary: The human eukaryotic initiation factor 2C1 (EIF2C1) gene 
belongs to a multigene family in human. It is highly conserved 
during evolution, sharing about 90% identity with rabbit Eif2c and 
70% identity with plant EGOl at the amino acid level. EIF2C 
stimulates formation of a ternary complex composed of EIF2, 
initiation methionyl-tRNA ( I ) (Met-tRNA) , and GTP .[ supplied by 
OMIM] . 

Location/Qualif iers 
1. .7478 
. /organism="Homo sapiens" 
/db_xref="taxon: 9606" 
/chromosome= M l 11 
/map= n lp35-p34" 
gene 1 . . 7478 

/gene= ,, EIF2Cl n 

/note="synonyms : Q99, EIF2C, GERP95 " 
/db xref= n LocusID: 26523 " 
/ db_x re f= "MIM : 606228 " 
CDS 214.. 27 87 

/gene="EIF2Cl" 

/note="similar to Arabidopsis thaliana AGOl; Golgi 
Endoplasmic Reticulum protein 95 kDa" 
/codon_start=l 

/product="eukaryotic translation initiation factor 2C, 1" 
/protein id=" NP 036331.1 " 
/db xref="GI: 6912352" 



FEATURES 
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misc feature 



misc feature 



variation 



variation 



variation 



BASE COUNT 
ORIGIN 

1 



1750 a 



/ db_x r e f = " Lo c u s I D : 26523 " 
/db xref="MIM: 606228 " 

/ translation="MEAGPSGAAAGAYLPPLQQVFQAPRRPGIGTVGKPIKLI±ANYFE 
VDI PKIDVYHYEVDIKPDKCPRRVNREVVEYMVQHFKPQI FGDRKPVYDGKKNI YTVT 
ALPIGNERVDFEVTIPGEGKDRIFKVSIKWLAIVSWRMLHEALVSGQIPVPLESVQAL 
DVAMRHLASMRYTPVGRSFFSPPEGYYHPLGGGREVWFGFHQSVRPAMWKM4LNIDVS 
ATAFYKAQPVI E FMCE VLDI RNI DEQPKPLTDSQRVR FTKE I KGLKVEVTHCGQMKRK 
YRVCNVTRRPASHQTFPLQLESGQTVECTVAQYFKQKYNLQLKYPHLPCLQVGQEQKH 
TYLPLEVCNIVAGQRCIKKLTDNQTSTMIKAT^SAPDRQEEISRLMKNASYNLDPYI 
QEFGIKVKDDMTEVTGRVLPAPILQYGGRNRAIATPNQGVWDMRGKQFYNGIEIKVWA 
IACFAPQKQCREEVLKNFTDQLRKISKDAGMPIQGQPCFCKYAQGADSVEPMFRHLKN 
TYSGLQLIIVILPGKTPVYAEVKRVGDTLLGMATQCVQVKNWKTSPQTLSNLCLKIN 
VKLGGINNILVPHQRSAVFQQPVIFLGADVTHPPAGDGKKPSITAWGSMDAHPSRYC 
ATVRVQRPRQEIIEDLSYMVRELLIQFYKSTRFKPTRIIFYRDGVPEGQLPQILHYEL 
LAIRDACIKLEKDYQPGITYIWQKRHHTRLFCADKNERIGKSGNIPAGTTVDTNITH 
PFEFDFYLCSHAGIQGTSRPSHYYVLWDDNRFTADELQILTYQLCHTYVRCTRSVSIP 
APAYYARLVAFRARYHLVDKEHDSGEGSHISGQSNGRDPQALAKAVQVHQDTLRTMYF 
A" 

904 . . 1320 
/gene="EIF2Cl" 

/note="PAZ; Region: PAZ domain. This domain is named PAZ. 
after the proteins Piwi Argonaut and Zwille. This domain 
is found in two families of proteins that are involved in 
post-transcriptional gene silencing. These are the Piwi 
family and the Dicer family, that includes the Carpel 
factory protein. The function of the domains is unknown 
but has been suggested to mediate complex formation 
between proteins of the Piwi and Dicer families by 
hetero-dimerisation" 
/ db_x r e f = " CDD : pfam02170 " 
1756. .2658 
/gene="EIF2Cl" 

/note="Piwi; Region: Piwi domain. This domain is found in 

the protein Piwi and its relatives. The function of this 

domain is unknown" 

/ db_x ref=" CDD : pfam02171 " 

complement (894) 

/allele= ,, T H 

/allele="C" 

/ db_x r e f= " db SNP : 2296470 " 
complement (3403) 
/allele="G" 
/allele="C" 

/db xref="dbSNP: 2057606 " 
4637 

/gene="EIF2Cl" 

/allele="G" 

/allele="A" 

/db xref="dbSNP: 3767701 " 
18 63 c 



1895 g 1970 t 



actggcagct 

61 ggaggctgcg 

121 gggatcccga 

181 ccgggccgcc 

241 gcgggcgctt 

301 ggcactgtgg 

361 atcgacgtgt 

421 cgggaagtgg 

481 cctgtgtatg 



ggccgggcgc 
ggggcggcgg 
gcagcgagag 
tgacctccgc 
acctgccccc 
ggaaaccaat 
accactacga 
tggaatacat 
atggaaagaa 



tcgcagtggg 
cgcgagcggc 
tgtggggtac 
acgggtatat 
cctgcagcag 
caagctcctg 
ggtggacatc 
ggtccagcat 
gaacatttac 



agctgctgca 
cgggcttggt 
ctaggcccct 
gggatggaag 
gtgttccagg 
gccaattact 
aagccggata 
ttcaagcctc 
actgtcacag 



ggctccgcgg 
aggggagccg 
cacgctggac 
cgggaccctc 
cacctcgccg 
ttgaggtgga 
agtgtccccg 
agatctttgg 
cactgcccat 



cggcggcaac 
agcccggccc 
ttcacagtct 
gggagcagct 
gcctggcatt 
catccctaag 
tagagtcaac 
tgatcgcaag 
tggcaacgaa 
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541 cgggtcgact ttgaggtgac aatccctggg gaagggaagg atcgaatctt taaggtctcc 
601 atcaagtggc tagccattgt gagctggcga atgctgcatg aggccctggt cagcggccag 
661 atccctgttc ccttggagtc tgtgcaagcc ctggatgtgg ccatgaggca cctggcatcc 
721 atgaggtaca cccctgtggg ccgctccttc ttctcaccgc ctgagggcta ctaccacccg 
781 ctggggggtg ggcgcgaagt ctggttcggc tttcaccagt ctgtgcgccc tgccatgtgg 
841 aagatgatgc tcaacattga tgtctcagcc actgcctttt ataaggcaca gccagtgatt 
901 gagttcatgt gtgaggtgct ggacatcagg aacatagatg agcagcccaa gcccctcacg 
961 gactctcagc gcgttcgctt caccaaggag atcaagggcc tgaaggtgga agtcacccac 
1021 tgtggacaga tgaagaggaa gtaccgcgtg tgtaatgtta cccgtcgccc tgctagccat 
1081 cagacattcc ccttacagct ggagagtgga cagactgtgg agtgcacagt ggcacagtat 
1141 ttcaagcaga aatataacct tcagctcaag tatccccatc tgccctgcct acaagttggc 
1201 caggaacaaa agcataccta ccttccccta gaggtctgta acattgtggc tgggcagcgc 
1261 tgtattaaga agctgaccga caaccagacc tcgaccatga taaaggccac agctagatcc 
1321 gctccagaca gacaggagga gatcagtcgc ctgatgaaga atgccagcta caacttagat 
1381 ccctacatcc aggaatttgg gatcaaagtg aaggatgaca tgacggaggt gacagggcga 
1441 gtgctgccgg cgcccatctt gcagtacggc ggccggaacc gggccattgc cacacccaat 
'1501 cagggtgtct gggacatgcg ggggaaacag ttctacaatg ggattgagat caaagtctgg 
1561 gccatcgcct gcttcgcacc ccaaaaacag tgtcgagaag aggtgctcaa gaacttcaca 
1621 gaccagctgc ggaagatttc caaggatgcg gggatgccta tccagggtca accttgtttc 
1681 tgcaaatatg cacagggggc agacagcgtg gagcctatgt tccggcatct caagaacacc 
1741 tactcagggc tgcagctcat tattgtcatc ctgccaggga agacgccggt gtatgctgag 
1801 gtgaaacgtg tcggagatac actcttggga atggctacgc agtgtgtgca ggtgaagaac 
1861 gtggtcaaga cctcacctca gactctgtcc aacctctgcc tcaagatcaa tgtcaaactt 
1921 ggtggcatta acaacatcct agtcccacac cagcgctctg ccgtttttca acagccagtg 
1981 atattcctgg gagcagatgt tacacacccc ccagcagggg atgggaaaaa accttctatc 
2041 acagcagtgg taggcagtat ggatgcccac cccagccgat actgtgctac tgtgcgggta 
2101 cagcgaccac ggcaagagat cattgaagac ttgtcctaca tggtgcgtga gctcctcatc 
2161 caattctaca agtccacccg tttcaagcct acccgcatca tcttctaccg agatggggtg 
2221 cctgaaggcc agctacccca gatactccat tatgagctac tggccattcg tgatgcctgc 
2281 atcaaactgg aaaaggacta ccagcctggg atcacttata ttgtggtgca gaaacgccat 
2341 cacacccgcc ttttctgtgc tgacaagaat gagcgaattg ggaagagtgg taacatccca 
2401 gctgggacca cagtggacac caacatcacc cacccatttg agtttgactt ctatctgtgc 
2461 agccacgcag gcatccaggg caccagccga ccatcccatt actatgttct ttgggatgac 
2521 aaccgtttca cagcagatga gctccagatc ctgacgtacc agctgtgcca cacttacgta 
2581 cgatgcacac gctctgtctc tatcccagca cctgcctact atgcccgcct ggtggctttc 
2641 cgggcacgat accacctggt ggacaaggag catgacagtg gagaggggag ccacatatcg 
2701 gggcagagca atgggcggga cccccaggcc ctggccaaag ccgtgcaggt tcaccaggat 
2761 actctgcgca ccatgtactt cgcttgaagg cagaacgctg ttacctcact ggatagaaga 
2821 aagctttcca agccccagga gctgtgccac ccaaatccag aggaagcaag gaggagggag 
2881 gtggggtagg gaggagtgta ggatgccttg tttccttcta tagaggtggt gtaagagtgg 
2941 ggaacagggc cagcaagaca gaccaccagc cagaaatctc tgatatcaac ctcatgtccc 
3001 ccacccctca ccccatcttg tcacatctgg ccctgacccc actggaccaa aaggggcagc 
3061 actggtgccc accatacaca caggtgtctc atgtgactca cagtgctaaa gactcatgct 
3121 tgacagcttg gtaaggtcaa ctctgtagcc ctgcagacaa aagctggtta ggtttgggtt 
3181 tgatacttta gatgggaaag tgaggggctt gagaaagtgg gtgggaggag ggaaggattt 
3241 tttaggagcc ttaatcagaa aaggactaga tttgtttaag aagaaaaatg aaaccagacc 
3301 cagatcaata ttttaggata ctagatgttt taatgggttc agaatccagt ttgtaggaag 
3361 attttttaat ggttttggtt gctcctcccc cagctgccac cccccacctt acccttattc 
3421 ctctctgtcc acattttctg ccccacctta cttctcctcc ctgacagaca tccagcccct 
3481 agtaatactt aaggcactat ggcacttagc tttgaagtga cacgaccctg tcttccttcc 
3541 gcccgctggt gggtaaccag tgccttccct gtaacggtaa tgctgcagaa ctgcaacctt 
3601 ttgtaccttt ctttggggaa tggggtgggg gtgggagagg aggtagatgg ggaagaaata 
3661 ccccagaccc aacaaacctc cagccagaaa gccagctatt ttgcatttga aggaattgac 
3721 ttcctcattc attgagcttt ttaaaagatc acaacctcaa gatggttaaa atccattgac 
3781 atttgcactt tcaaacatga caagtctcgg agctgctgag atgacaggcc cctggccttt 
3841 ccacttatgc ctccttttct ccttattcct cctacctccc gccccgccca ggtctggagt 
3901 tactttcata gcatttttca ctcttggctt cttttctccc ttgatggtca agtctcttat 
3961 gtttcaatat tt;cttaactg gggtgtctta taacaaaaaa ctcttaggtc taaaatgaga 
4021 aaaaagagag aaaacaaaat gttattttta taccataact tgagtgtatt gccaaaattt 
4081 ggaaatcctt cccatgcctg atgagtttat atcccagaaa cattgagcca tcagaatgaa 
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4141 ctgtgtacct gatttgttct • ctgacctggc 
4201 gatggggtcc aggctccatc cttcctctgt 
4261 ccctcctctg cactgtcctg cactctttct 
4321 tgtcctctga ccctttggct catcctagat 
4381 ttttgctgct ccctattgct tctgtttaca 
4441 gcatgtgggt gggtggcatg gacttttttt 
4501 ttggctgtct tgcaggactg gagaaggtgg 
4561 agcaagcatc ccccctgccc tttttccttg 
4621 gggatgggga gttgcaactt cagtgtggaa 
4681 tatcctgatc tggtgatgaa gccatgatta 
4741 ctggaggaag aagggtctaa atcctggcct 
4801 agctgccctt gtatgacccg gatttgctat 
4861 gttggctaca ttgttcagca actcacaaaa 
4921 atcaggactg ttgagtaact cctcctttac 
4981 cctataggca caagcccagc tgaagaacag 
5041 tggagagcct acattcctta cacaagtgcc 
5101 cctgtccatt gccttcatat acagtctact 
5161 ttctcctggg acagtgggct ctgcatgttc 
5221 agggcactat tcctggaggg tccagtcatt 
5281 aggtggcagc ctctactccc agcaacaagt 
5341 actctctcca gttggttttc agctggggct 
5401 ttatgccatt caagaaataa aaagcaagag 
5461 ttcttactct gatttttttg taaaaagata 
5521 tgagattatt cctgtttgaa aggtggtgca 
5581 ctttggctca tgtggatttg gtttaagtgg 
5641 tgggggaccc ccagcctcat cctcttagtt 
5701 tgccttcctt ttcctcttct ctctttgcct 
5761 tgcttgcttg ctggccggcc tgcctgcctg 
5821 tgatgatgaa atctctgcat ggctgcaatg 
5881 agctccttga ctgcagaaga ccaagaacct 
5941 ctggactgcc ctcaagctta tactagagaa 
6001 tcaggagggt ttctggcatt ttccacacct 
6061 gctttttcta aatctggttc tttttctctt 
6121 ttagggttga gctatttggg tatcctgggt 
6181 aagagtgatg agaagagaat ggagagaatt 
6241 tctttgattg tttatccagt ccaacctgat 
6301 ccagggataa gcctggggct actgttgctg 
6361 ggtacctaga aatttgaaac ttccctaaaa 
6421 ctctgtggcc tcctatttag ctaagcagca 
6481 tgaataaggc cagcactcaa gatgggcagc 
6541 aggatatctg taaggctggt gggacagttt 
6601 ttggacgttt cttccccatc agggtagaaa 
6661 tggaaactac attgggggac gttattttta 
6721 tggtagctgg aataccttcc ttcttaaaat 
6781 tttactattt ggccttctaa gcagattggg 
6841 ccgacttaat aggacttgcc ttctccctgg 
6901 cttactctac tcatactgac ttagagcctc 
6961 ggggaaggaa tgagctaaaa acaaaacaga 
7021 acagaggaaa ataggaaacc ctccaagaat 
7081 tgagtccctt ggtgtagtag caataaggaa 
7141 cagcctaatt ggtatgtgat gttgcactta 
7201 ctggttttca ctttagtttc taaacttttt 
7261 atgtctctgg atccccacag ctgtgtactt 
7321 tttgtgtcct gctttgagct gtaccttgtc 
7381 taatgtacag ttccttctga agcaagcaac 
7441 ctgtgtttta taaagacaac agtggcttct 



taggtaggga gggggtggtt atcgccccaa 
gcagataata cctttttctt gctatagcct 
tgcaagtgca tctttttcct tcccctggac 
tgcagtgtgt cctgtggaca ggctggggaa 
aaaatgaatt tttcctggtt tcccactagg 
tttttttttt tttgtcttga gacatggggt 
tggttctagc ttggtctctg ttggccttga 
actgttcatt tttttcctgc cccactgctt 
tttcctcttt gaggagcctg ggcttggatc 
ctttagacct agcccaggct tggaggccag 
• gtagagttag aactaccatt tcctcccctt 
gcaaaacaat ctatcccagg ttctgttctg 
cgtagcacaa acattcatta tggagaaagc 
ttttttcctg ctggctacag catggggtgc 
aatggagggc tctgggagga ggcagctcac 
taaagagagt gatgctaaca ctccatctgc 
tcgtgttctg tcaccctttg gggaggggag 
tccacttgga tacattttgg ggctaggatc 
caccagcatt tgcaaatgtc catagggagc 
ttgtgttctc tccttttctc tctttgcctc 
tgaaatgcat ttttagccct ttgacgtggc 
aatcagcttt gggcaatgac aagaaatgag 
atttttgaga cttgaaaaat accccgacct 
tgcagatgga gaagtggtgt tggcagcaag 
tgcttcttac ccaagcttca aggaagtgct 
gggtctcttg ' ttccctttgt accactgttt 
ggcttccttt cccttttctt ctattcactc 
cctgcctgcc tgcctgcctg tctgcctatg 
atcccactgt tagctggcag ggtcaggctt 
gttccccaag cccagagatg tccacctggg 
gagcaactga cctgcccaac ttgtgtgaag 
gtccactcct tggagctggt ttctctcatt 
tacctggggc ctggcttttc tgagattgtc 
ttgagtgtta ggggatggac ataaaggaaa 
tgaataaaag gtgggaaagg agagcactgt 
ccattaggga tcgaggtgct acactggcct 
ggaacttagg cttaacataa agccgaagaa 
agctcctaat gcccacctgc tagatagctt 
gtgtttttgg atactttttt tttctgtttg 
caagggtgca ctgactatta gctggcccat 
tggacctgga atcatgtgta actaacaagg 
aatcatctca aactagccaa aaggcagttt 
tttatatatg gggcctaggc caatccagga 
ctgatcatgg cagggatatg cagggcactt 
aaggaggtat tttctggttt tcgctttcct 
gcagggagag aggctgggtt ggtgctctcc 
tggctgctgt ttgggcatcc aagaaaggga 
atgaggtggg aaagggagat tttcttcttt 
tgtgcaagta aagacatttg ttgaatgcac 
aaatgaaatt actttcctgt gcacacagtc 
gcagccatgt ggtgggcatg tgtgactact 
atccctctca agtccagcat ggatggggaa 
gtttgcattt gtttcccttt gagatttgtg 
cagtccattg tgaaattatc ccagcagctg 
atcagcagca gcagcagcag cagcacaatt 
atttctaa 



Revised: July 5, 2002. 
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DEFINITION 
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KEYWORDS 

SOURCE 
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REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

FEATURES 

source 



CDS 



AF093097 7478 bp mRNA linear PRI 16-NOV-1999 

Homo sapiens putative RNA-binding protein Q99 mRNA, complete cds . 
AF093097 

AF093097 .1 GI : 6002622 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 7478) 

Koesters,R., Adams , V . , Betts , D . , Moos,R., Schmid,M., Siermann^. , 
Hassam,S., Weitz,S., Lichter,P., Heitz,P.U., von Knebel 
Doeberitz,M. and Briner,J. 

Human eukaryotic initiation factor EIF2C1 gene: cDNA sequence, 
genomic organization, localization to chromosomal bands Ip34-p35, 
and expression 



(2), 210-218 (1999) 



Moos,R., Schmid,M., Doeberitz, M. V. K. and 



Genomics 61 
20005943 
10534406 

2 (bases 1 to 7478) 
Koesters,R., Briner,J. 
Betts, D. 

Direct Submission 

Submitted ( 18-SEP-1998 ) Surgery, University Hospital, INF 110, 
Heidelberg 69120, Germany 

Location/Qualifiers 

1..7478 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
/ chromosome= " 1 " 
/map= n lp34-p35 n 
214. .2787 

/note= n similar to Arabidopsis thaliana AGOl" 
/codon_start=l 

/product="putative RNA-binding protein Q99" 
/protein id=" AAF00068 . 1 " 
/db_xref="GI : 6002623" 

/ translation="MEAGPSGAAAGAYLPPLQQVFQAPRRPGIGTVGKPIKLLANYFE 
VDIPKIDVYHYEVDIKPDKCPRRVNREWEYMVQHFKPQIFGDRKPVYDGKKNIYTVT 
ALPIGNERVDFEVTIPGEGKDRIFKVSIKWLAIVSWRMLHEALVSGQIPVPLESVQAL 
DVAMRHLASMRYTPVGRSFFSPPEGYYHPLGGGREVWFGFHQSVRPAMWKMMLNIDVS 
ATAFYKAQPVIEFMCEVLDIRNIDEQPKPLTDSQRVRFTKEIKGLKVEVTHCGQMKRK 
YRVCNVTRRPASHQTFPLQLESGQTVECTVAQYFKQKYNLQLKYPHLPCLQVGQEQKH 
TYLPLEVCNIVAGQRCIKKLTDNQTSTMIKATARSAPDRQEEISRLMKNASYNLDPYI 
QE FGI KVKDDMTE VTGRVL PAPI LQ YGGRNRAIAT PNQGVWDMRGKQFYNGI E I KVWA 
IACFAPQKQCREEVLKNFTDQLRKISKDAGMPIQGQPCFCKYAQGADSVEPMFRHLKN 
TYSGLQLIIVILPGKTPVYAEVKRVGDTLLGMATQCVQVKNWKTSPQTLSNLCLKIN 
VKLGGINNILVPHQRSAVFQQPVIFLGADVTHPPAGDGKKPSITAWGSMDAHPSRYC 
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ATVRVQRPRQEI IEDLSYMVRELLIQFYKSTRFKPTRI I FYRDGVPEGQLPQILHYEL 
LAIRDACIKLEKDYQPGITYIWQKRHHTRLFCADKNERIGKSGNIPAGTTVDTNITH 
PFEFDFYLCSHAGIQGTSRPSHYYVLWDDNRFTADELQILTYQLCHTYVRCTRSVSIP 
APAYYARLVAFRARYHLVDKEHDSGEGSHISGQSNGRDPQALAKAVQVHQDTLRTMYF 
A" 

BASE COUNT 1750 a 1863 c 1895 g 1970 t 

ORIGIN 

1 actggcagct ggccgggcgc tcgcagtggg agctgctgca ggctccgcgg cggcggcaac 
61 ggaggctgcg ggggcggcgg cgcgagcggc cgggcttggt aggggagccg agcccggccc 
121 gggatcccga gcagcgagag tgtggggtac ctaggcccct cacgctggac ttcacagtct 
181 ccgggccgcc tgacctccgc acgggtatat gggatggaag cgggaccctc gggagcagct 
241 gcgggcgctt acctgccccc cctgcagcag gtgttccagg cacctcgccg gcctggcatt 
301 ggcactgtgg ggaaaccaat caagctcctg gccaattact ttgaggtgga catccctaag 
361 atcgacgtgt accactacga ggtggacatc aagccggata agtgtccccg tagagtcaac 
421 cgggaagtgg tggaatacat ggtccagcat ttcaagcctc agatctttgg tgatcgcaag 
481 cctgtgtatg atggaaagaa gaacatttac actgtcacag cactgcccat tggcaacgaa 
541 cgggtcgact ttgaggtgac aatccctggg gaagggaagg atcgaatctt taaggtctcc 
601 atcaagtggc tagccattgt gagctggcga atgctgcatg aggccctggt cagcggccag 
661 atccctgttc ccttggagtc tgtgcaagcc ctggatgtgg ccatgaggca cctggcatcc 
721 atgaggtaca cccctgtggg ccgctccttc ttctcaccgc ctgagggcta ctaccacccg 
781 ctggggggtg ggcgcgaagt ctggttcggc tttcaccagt ctgtgcgccc tgccatgtgg 
841 aagatgatgc tcaacattga tgtctcagcc actgcctttt ataaggcaca gccagtgatt 
901 gagttcatgt gtgaggtgct ggacatcagg aacatagatg agcagcccaa gcccctcacg 
961 gactctcagc gcgttcgctt caccaaggag atcaagggcc tgaaggtgga agtcacccac 
1021 tgtggacaga tgaagaggaa gtaccgcgtg tgtaatgtta cccgtcgccc tgctagccat 
1081 cagacattcc ccttacagct ggagagtgga cagactgtgg agtgcacagt ggcacagtat 
1141 ttcaagcaga aatataacct tcagctcaag tatccccatc tgccctgcct acaagttggc 
1201 caggaacaaa agcataccta ccttccccta gaggtctgta acattgtggc tgggcagcgc 
1261 tgtattaaga agctgaccga caaccagacc tcgaccatga taaaggccac agctagatcc 
1321 gctccagaca gacaggagga gatcagtcgc ctgatgaaga atgccagcta caacttagat 
1381 ccctacatcc aggaatttgg gatcaaagtg aaggatgaca tgacggaggt gacagggcga 
1441 gtgctgccgg cgcccatctt gcagtacggc ggccggaacc gggccattgc cacacccaat 
1501 cagggtgtct gggacatgcg ggggaaacag ttctacaatg ggattgagat caaagtctgg 
1561 gccatcgcct gcttcgcacc ccaaaaacag tgtcgagaag aggtgctcaa gaacttcaca 
1621 gaccagctgc ggaagatttc caaggatgcg .gggatgccta tccagggtca accttgtttc 
1681 tgcaaatatg cacagggggc agacagcgtg gagcctatgt tccggcatct caagaacacc 
1741 tactcagggc tgcagctcat tattgtcatc ctgccaggga agacgccggt gtatgctgag 
1801 gtgaaacgtg tcggagatac actcttggga atggctacgc agtgtgtgca ggtgaagaac 
1861 gtggtcaaga cctcacctca gactctgtcc aacctctgcc tcaagatcaa tgtcaaactt 
1921 ggtggcatta acaacatcct agtcccacac cagcgctctg ccgtttttca acagccagtg 
1981 atattcctgg gagcagatgt tacacacccc ccagcagggg atgggaaaaa accttctatc 
2041 acagcagtgg taggcagtat ggatgcccac cccagccgat actgtgctac tgtgcgggta 
2101 cagcgaccac ggcaagagat cattgaagac ttgtcctaca tggtgcgtga gctcctcatc 
2161 caattctaca agtccacccg tttcaagcct acccgcatca tcttctaccg agatggggtg 
2221 cctgaaggcc agctacccca gatactccat tatgagctac tggccattcg tgatgcctgc 
2281 atcaaactgg aaaaggacta ccagcctggg atcacttata ttgtggtgca gaaacgccat 
2341 cacacccgcc ttttctgtgc tgacaagaat gagcgaattg ggaagagtgg taacatccca 
2401 gctgggacca cagtggacac caacatcacc cacccatttg agtttgactt ctatctgtgc 
2461 agccacgcag gcatccaggg caccagccga ccatcccatt actatgttct ttgggatgac 
2521 aaccgtttca cagcagatga gctccagatc ctgacgtacc agctgtgcca cacttacgta 
2581 cgatgcacac gctctgtctc tatcccagca cctgcctact atgcccgcct ggtggctttc 
2641 cgggcacgat accacctggt ggacaaggag catgacagtg gagaggggag ccacatatcg 
2701 gggcagagca atgggcggga cccccaggcc ctggccaaag ccgtgcaggt tcaccaggat 
2761 actctgcgca ccatgtactt cgcttgaagg cagaacgctg ttacctcact ggatagaaga 
2821 aagctttcca agccccagga gctgtgccac ccaaatccag aggaagcaag gaggagggag 
2881 gtggggtagg gaggagtgta ggatgccttg tttccttcta tagaggtggt gtaagagtgg 
2941 ggaacagggc cagcaagaca gaccaccagc cagaaatctc tgatatcaac ctcatgtccc 
3001 ccacccctca ccccatcttg tcacatctgg ccctgacccc actggaccaa aaggggcagc 
3061 actggtgccc accatacaca caggtgtctc atgtgactca cagtgctaaa gactcatgct 
3121 tgacagcttg gtaaggtcaa ctctgtagcc ctgcagacaa aagctggtta ggtttgggtt 
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3181 tgatacttta gatgggaaag tgaggggctt 
3241 tttaggagcc ttaatcagaa aaggactaga 
3301 cagatcaata ttttaggata ctagatgttt 
3361 attttttaat ggttttggtt gctcctcccc 
3421 ctctctgtcc acattttctg ccccacctta 
3481 agtaatactt aaggcactat ggcacttagc 
3541 gcccgctggt gggtaaccag tgccttccct 
3601 ttgtaccttt ctttggggaa tggggtgggg 
3661 ccccagaccc aacaaacctc cagccagaaa 
3721 ttcctcattc attgagcttt ttaaaagatc 
3781 atttgcactt tcaaacatga caagtctcgg 
3841 ccacttatgc ctccttttct ccttattcct 
3901 tactttcata gcatttttca ctcttggctt 
3961 gtttcaatat ttcttaactg gggtgtctta 
4021 aaaaagagag aaaacaaaat gttattttta 
4081 ggaaatcctt cccatgcctg atgagtttat 
4141 ctgtgtacct gatttgttct ctgacctggc 
4201 gatggggtcc aggctccatc cttcctctgt 
4261 ccctcctctg cactgtcctg cactctttct 
4321 tgtcctctga ccctttggct catcctagat 
4381 ttttgctgct ccctattgct tctgtttaca 
4441 gcatgtgggt gggtggcatg gacttttttt 
4501 ttggctgtct tgcaggactg gagaaggtgg 
4561 agcaagcatc ccccctgccc tttttccttg 
4621 gggatgggga gttgcaactt cagtgtggaa 
4681 tatcctgatc tggtgatgaa gccatgatta 
4741 ctggaggaag aagggtctaa atcctggcct 
4801 agctgccctt gtatgacccg gatttgctat 
4861 gttggctaca ttgttcagca actcacaaaa 
4921 atcaggactg ttgagtaact cctcctttac 
4981 cctataggca caagcccagc tgaagaacag 
5041 tggagagcct acattcctta cacaagtgcc 
5101 cctgtccatt gccttcatat acagtctact 
5161 ttctcctggg acagtgggct ctgcatgttc 
5221 agggcactat tcctggaggg tccagtcatt 
5281 aggtggcagc ctctactccc agcaacaagt 
5341 actctctcca gttggttttc agctggggct 
5401 ttatgccatt caagaaataa aaagcaagag 
5461 ttcttactct gatttttttg taaaaagata 
5521 tgagattatt cctgtttgaa aggtggtgca 
5581 ctttggctca tgtggatttg gtttaagtgg 
5641 tgggggaccc ccagcctcat cctcttagtt 
5701 tgccttcctt ttcctcttct ctctttgcct 
5761 tgcttgcttg ctggccggcc tgcctgcctg 
5821 tgatgatgaa atctctgcat ggctgcaatg 
5881 agctccttga ctgcagaaga ccaagaacct 
5941 ctggactgcc ctcaagctta tactagagaa 
6001 tcaggagggt ttctggcatt ttccacacct 
6061 gctttttcta aatctggttc tttttctctt 
6121 ttagggttga gctatttggg tatcctgggt 
6181 aagagtgatg agaagagaat ggagagaatt 
6241 tctttgattg tttatccagt ccaacctgat 
6301 ccagggataa gcctggggct actgttgctg 
6361 ggtacctaga aatttgaaac ttccctaaaa 
6421 ctctgtggcc tcctatttag ctaagcagca 
6481 tgaataaggc cagcactcaa gatgggcagc 
6541 aggatatctg taaggctggt gggacagttt 
6601 ttggacgttt cttccccatc agggtagaaa 
6661 tggaaactac attgggggac gttattttta 
6721 tggtagctgg aataccttcc ttcttaaaat 



gagaaagtgg gtgggaggag ggaaggattt 
tttgtttaag aagaaaaatg aaaccagacc 
taatgggttc agaatccagt ttgtaggaag 
cagctgccac cccccacctt acccttattc 
cttctcctcc ctgacagaca tccagcccct 
tttgaagtga cacgaccctg tcttccttcc 
gtaacggtaa tgctgcagaa ctgcaacctt 
gtgggagagg aggtagatgg ggaagaaata 
gccagctatt ttgcatttga aggaattgac 
acaacctcaa gatggttaaa atccattgac 
agctgctgag atgacaggcc cctggccttt 
cctacctccc gccccgccca ggtctggagt 
cttttctccc ttgatggtca agtctcttat. 
taacaaaaaa ctcttaggtc taaaatgaga 
taccataact tgagtgtatt gccaaaattt 
atcccagaaa cattgagcca tcagaatgaa 
taggtaggga gggggtggtt atcgccccaa 
gcagataata cctttttctt gctatagcct 
tgcaagtgca tctttttcct tcccctggac 
tgcagtgtgt cctgtggaca ggctggggaa 
aaaatgaatt tttcctggtt tcccactagg 
tttttttttt tttgtcttga gacatggggt 
tggttctagc ttggtctctg ttggccttga 
actgttcatt tttttcctgc cccactgctt 
tttcctcttt gaggagcctg ggcttggatc 
ctttagacct agcccaggct tggaggccag 
gtagagttag aactaccatt ■ tcctcccctt 
gcaaaacaat ctatcccagg ttctgttctg 
cgtagcacaa acattcatta tggagaaagc 
ttttttcctg ctggctacag catggggtgc 
aatggagggc tctgggagga ggcagctcac 
taaagagagt gatgctaaca ctccatctgc 
tcgtgttctg tcaccctttg gggaggggag 
tccacttgga tacattttgg ggctaggatc 
caccagcatt tgcaaatgtc catagggagc 
ttgtgttctc tccttttctc tctttgcctc 
tgaaatgcat ttttagccct ttgacgtggc 
aatcagcttt gggcaatgac aagaaatgag 
atttttgaga cttgaaaaat accccgacct 
tgcagatgga gaagtggtgt tggcagcaag 
tgcttcttac ccaagcttca aggaagtgct 
gggtctcttg ttccctttgt accactgttt 
ggcttccttt cccttttctt ctattcactc 
cctgcctgcc tgcctgcctg tctgcctatg 
atcccactgt tagctggcag ggtcaggctt 
gttccccaag cccagagatg tccacctggg 
gagcaactga cctgcccaac ttgtgtgaag 
gtccactcct tggagctggt ttctctcatt 
tacctggggc ctggcttttc tgagattgtc 
ttgagtgtta ggggatggac ataaaggaaa 
tgaataaaag gtgggaaagg agagcactgt 
ccattaggga tcgaggtgct acactggcct 
ggaacttagg cttaacataa agccgaagaa 
agctcctaat gcccacctgc tagatagctt 
gtgtttttgg atactttttt tttctgtttg 
caagggtgca ctgactatta gctggcccat 
tggacctgga atcatgtgta actaacaagg 
aatcatctca aactagccaa aaggcagttt 
tttatatatg gggcctaggc caatccagga 
ctgatcatgg cagggatatg cagggcactt 
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6781 tttactattt ggccttctaa gcagattggg aaggaggtat tttctggttt tcgctttcct 

6841 ccgacttaat aggacttgcc ttctccctgg gcagggagag aggctgggtt ggtgctctcc 

6901 cttactctac tcatactgac ttagagcctc tggctgctgt ttgggcatcc aagaaaggga 

6961 ggggaaggaa tgagctaaaa acaaaacaga atgaggtggg aaagggagat tttcttcttt 

7021 acagaggaaa ataggaaacc ctccaagaat tgtgcaagta aagacatttg ttgaatgcac 

7081 tgagtccctt ggtgtagtag caataaggaa aaatgaaatt actttcctgt gcacacagtc 

7141 cagcctaatt ggtatgtgat gttgcactta gcagccatgt ggtgggcatg tgtgactact 

7201 ctggttttca ctttagtttc taaacttttt atccctctca agtccagcat ggatggggaa 

7261 atgtctctgg atccccacag ctgtgtactt gtttgcattt gtttcccttt gagatttgtg 

•7321 tttgtgtcct gctttgagct gtaccttgtc cagtccattg tgaaattatc ccagcagctg 

7381 taatgtacag ttccttctga agcaagcaac atcagcagca gcagcagcag cagcacaatt 

7441 ctgtgtttta taaagacaac agtggcttct atttctaa 

If 
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